Skip to content

Fix: missing locale, false locale regex matches, related bugs#60

Merged
dacharyc merged 1 commit intomainfrom
fix/missing-locale-prefix-handling
Apr 25, 2026
Merged

Fix: missing locale, false locale regex matches, related bugs#60
dacharyc merged 1 commit intomainfrom
fix/missing-locale-prefix-handling

Conversation

@dacharyc
Copy link
Copy Markdown
Member

Issue #56 raised a handful of issues:

  • Locale filtering failure when llms.txt has no locale segment
  • Two-locale sites failing the detection threshold
  • Possible false regex matches for non-locale segments

This PR introduces a set of ISO 639-1 language codes to match against specifically, with some heuristics for trying to avoid false matches for topic segments that overlap with language codes. It also fixes up a handful of locale-related bugs. It should resolve the cases identified in #56. 🤞

@dacharyc dacharyc merged commit b0dce29 into main Apr 25, 2026
2 checks passed
@dacharyc dacharyc deleted the fix/missing-locale-prefix-handling branch April 25, 2026 14:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant